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Abstract 

Background: Pre-genomic and post-genomic studies demonstrate that chlamydiae actively recombine in vitro and 
in vivo, although the molecular and cellular biology of this process is not well understood. In this study, we 
determined the genome sequence of twelve Chlamydia trachomatis recombinants that were generated in vitro 
under antibiotic selection. These strains were used to explore the process of recombination in Chlamydia spp., 
including analysis of candidate recombination hotspots, and to correlate known C. trachomatis in vitro phenotypes 
with parental phenotypes and genotypes. 

Results: Each of the 190 examined recombination events was the product of homologous recombination, and no 
candidate targeting motifs were identified at recombination sites. There was a single deletion event in one 
recombinant progeny that resulted in the removal of 17.1 kilobases between two rRNA operons. There was no 
evidence for preference for any specific region of the chromosome for recombination, and analyses of a total of 
over 200 individual recombination events do not provide any support for recombination hotspots in vitro. Two 
measurable phenotypes were analyzed in these studies. First, the efficiency of attachment to host cells in the 
absence of centrifugation was examined, and this property segregated to regions of the chromosome that carry 
the polymorphic membrane protein (Pmp) genes. Second, the formation of secondary inclusions within cells varied 
among recombinant progeny, but this did not cleanly segregate to specific regions of the chromosome. 

Conclusions: These experiments examined the process of recombination in C. trachomatis and identified tools that 
can be used to associate phenotype with genotype in recombinant progeny. There were no data supporting the 
hypothesis that particular nucleotide sequences are preferentially used for recombination in vitro. Selected 
phenotypes can be segregated by analysis of recombination, and this technology may be useful in preliminary 
analysis of the relationship of genetic variation to phenotypic variation in the chlamydiae. 
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Background 

Chlamydia trachomatis is a Gram- negative obligate 
intracellular bacterium that is a leading cause of prevent- 
able blindness and sexually transmitted diseases world- 
wide [1]. Much of the biology of infection and disease 
remains unclear in this system, owing largely to the lack 
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of a routine genetic system for these organisms. While 
many aspects of these challenges have recently been 
overcome [2,3], the use of genetic transformation in this 
system is just beginning to be exploited. One aspect of 
chlamydial biology that is poorly understood involves 
the mechanism of lateral gene transfer among chlamy- 
dial strains both in the laboratory and, most likely, in pa- 
tients. Coinfection of host cells in vitro with chlamydial 
isolates encoding different drug resistance markers lead 
to generation of dual resistant recombinant progeny 
[4,5]. These results lend support to pre- and post-genomic 
analyses demonstrating that chlamydiae recombine in 
infected hosts [6-12], likely following infection of a patient 
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by multiple strains [13]. Chlamydia spp. encode no 
recognizable bacterial gene transfer systems, thus the 
mechanisms underlying chlamydial recombination re- 
main unknown. 

C. trachomatis and many other chlamydiae are differ- 
entiated into distinct serovars based on antibody specifi- 
city to the major outer membrane protein (MOMP or 
OmpA), encoded by ompA. Serovars and subserovars of 
C. trachomatis fall into three groups those associated 
with trachoma (serovars A, B, and C), those associated 
with non-invasive sexually transmitted infections of the 
urogenital tract (serovars D through K), and those asso- 
ciated with invasive lymphogranuloma (LGV; serovars 
LI to L3) [14]. This historical classification system has 
recently been modified to a genotypic characterization 
of strains, both by sequencing of ompA and the inclusion 
of a variety of other markers in the analysis [15-17]. 
Nevertheless, many of the biological differences among 
chlamydiae still can be grouped by the serovar-based 
classification scheme. Clinically relevant differences 
among the chlamydiae include host tropism, variation in 
disease outcome, and in vitro biology. With some excep- 
tions (reviewed in [18]), such as tryptophan utilization 
[19,20] and fusogenicity of inclusions [21], the relation- 
ship between genotype and phenotype is not clear 
in vitro and certainly not with regards to how the phe- 
notypes observed in cell culture relate to the disease po- 
tential of a particular strain. Two such phenotypes that 
are different among C. trachomatis strains include the 
historical difference among serovars regarding attach- 
ment and invasion in the presence or absence of cen- 
trifugation during the infectious process [22], and 
secondary inclusion formation by different chlamydial 
strains [23]. Deciphering the genetic basis of these and 
other phenotypes is complicated by the relatively primi- 
tive molecular genetic techniques that have been avail- 
able for studying chlamydial biology, although this 
situation is changing. 

In the present study, genetically mosaic recombinant 
strains from parents with differing cell culture pheno- 
types were generated in vitro, cloned by limiting dilu- 
tion, and subjected to complete genome sequence 
analysis. These strains, the parentals used in the crosses, 
and selected clinical isolates were used to investigate the 
process of chlamydial genetic exchange, and to develop 
and test a system for a primary examination of attach- 
ment and invasion as well as secondary inclusion forma- 
tion phenotypes in C. trachomatis. 

Results 

Generation of recombinant strains 

A collection of recombinant strains was generated using 
parent strains within serovars J, F, and L2 (Table 1, Figure 1). 
These included IncA-positive strains J/6276 and L2-434, 



and the IncA negative strain F(s)/70. In some cases, 
crosses involved two parents (i.e. crosses 1-6, 11,12); 
while in other cases three-way crosses were attempted (i.e. 
Table 1, crosses 7-10). Each of these crosses yielded 
expected inclusion fusion patterns, leading to cells 
infected with both IncA-positive strains forming fused in- 
clusions, while cells infected with either IncA-positive 
strain and the F(s)/70 IncA-negative strain having multiple 
inclusions per cell. Mixtures of all three of these strains 
led to cells having mixed inclusions containing the two 
IncA-positive strains, and a set of isolated inclusions 
containing the IncA-negative F(s)/70 strain (Figure 2). 
While progeny with each possible combination of two 
antibiotic resistance markers were routinely identified in 
the three-way crosses, no triply resistant strain was reco- 
vered in any experiment. Additionally, no single progeny 
strain had sequence at any informative position from each 
of the three parents in a three-way cross. Recombinant 
progeny were generated in crosses of both IncA-positive 
and IncA-negative parents, with no apparent difference in 
the rate of recovery of recombinants relative to the IncA 
status of the parent (not shown). 

Genome sequence analysis of recombinant strains 

The genomes of the twelve recombinant strains were se- 
quenced using Illumina paired-end technology (Figure 3). 
In all recombinant strains, the sequences surrounding 
the individual resistance markers were derived from the 
appropriate parent, supporting the conclusion that these 
were recombinant strains and not spontaneous mutants 
that emerged during the selection process. There was 
evidence of a single random mutation in one recombin- 
ant, strain RC-L2(s)/3. This mutation was a G (L2-434 
sequence) to A [RC-L2(s)/3] substitution at position 
293,505 (genome accession CP002676), resulting in an 
alanine to valine amino acid change in the protein pro- 
duct of CT258. This same mutation was identified in the 
RC-J(s)/122 genome, a progeny of a cross in which RC- 
L2(s)/3 was a parent. There was no other evidence of 
random base change in any other sequenced recombin- 
ant genome. 

With one exception, the exchange of DNA in each re- 
combination event yielded products consistent with clas- 
sical gene conversion or homologous recombination. 
The exception involves a recombination/deletion event 
involving the ribosomal operons which occurred in the 
cross between parental strains RC-L2(s)/3 and RC-J/ 
6276 tet yielding recombinant strain RC-J(s)/122 (Table 1, 
cross 12). This cross led to a deletion of one of the three 
ribosomal operons found in most of the in vitro-gener- 
ated Tet r C. trachomatis strains (Figure 1, [5]), returning 
the progeny strain to the number of ribosomal operons 
found in wild-type C. trachomatis and other closely 
related species (Figure 4). This event also led to the 



Jeffrey et al. BMC Microbiology 201 3, 13:142 
http://www.biomedcentral.com/1471 -21 80/1 3/1 42 



Page 3 of 14 



Table 1 Phenotypes of parents and progeny in recombinant crosses 



MIC (ug/ml) Phenotype 



Cross Progeny 


Parental strains 


OmpA 


Rif 


Ofl 


Tet 


Plasmid 


Inclusion fusion 


Attachment 


2° inclusion formation 


1 RC-J/6276tet 




J 


8 


0.5 


8 


- 


+ 


- 


2 




L2-434tet-13 


L2 


0.008 


16 


8 


+ 


+ 


+ 


1 




J/6276rif 


J 


8 


0.5 


0.032 


- 


+ 


- 


1 


2 RC-F(s)/342 


RC-J/6276tet-rif 


F 
J 


32 
8 


0.5 
0.5 


8 
8 


- 
- 


- 

+ 


- 
- 


N/A 
1 




F(s)/70rif 


F 


32 


0.5 


0.032 


- 


- 


- 


N/A 


3 RC-L2(s)/46 




L2 


32 


16 


0.032 


+ 


- 


+ 


N/A 




L2-434ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




F(s)70rif 


F 


32 


0.5 


0.032 


- 


- 


- 


N/A 


4 RC-F/69 




F 


32 


4 


0.032 


+ 


+ 


+ 


1 




L2-434ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




F(s)70rif 


F 


32 


0.5 


0.032 


- 


- 


- 


N/A 


5 RC-L2(s)/3 




L2 


32 


4 


0.032 


- 


- 


+ 


N/A 




L2-434ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




F(s)70rif 


F 


32 


0.5 


0.032 


- 


- 


- 


N/A 


6 RC-L2/55 




L2 


32 


4 


0.032 


- 


+ 


+ 


N/A 




L2-434ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




F(s)/70rif 


F 


32 


0.5 


0.032 


- 


- 


- 


N/A 


7 RC-J/953 




J 


8 


16 


0.032 


+ 


+ 


+ 


4 




L2-434ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




RC-F(s)/343tet-rif 


F 


32 


0.5 


8 


- 


- 


- 


N/A 




J/6276rifR 


J 


8 


0.5 


0.032 


- 


+ 


- 


1 


8 RC-J/943 




J 


8 


16 


0.032 


+ 


+ 


+ 


1 




L2-434/ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




RC-F(s)/343tet-rif 


F 


32 


0.5 


8 


- 


- 


- 


N/A 




J/6276rif 


J 


8 


0.05 


0.032 


- 


+ 


- 


1 


9 RC-J/966 




J 


8 


16 


0.032 


+ 


+ 


+ 


1 




L2-434/ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




RC-F(s)/343tet-rif 


F 


32 


0.5 


8 


- 


- 


- 


N/A 




J/6276rif 


J 


8 


0.5 


0.032 


- 


+ 


- 


1 


10 RC-L2/971 




J 


8 


16 


0.032 


+ 


+ 


+ 


4 




L2-434/ofl 


L2 


0.008 


16 


0.032 


+ 


+ 


+ 


1 




RC-F(s)/343tet-rif 


F 


32 


0.5 


8 








N/A 




J/6276rif 


J 


8 


0.5 


0.032 




+ 




1 


1 1 RC-F(s)/852 




F 


32 


4 


8 


+ 






N/A 




RC-F/69 


F 


32 


4 


0.032 


+ 


+ 


+ 


1 




RC-F(s)/343tet-rif 


F 


32 


0.5 


8 








N/A 


12 RC-J(s)/122 




J 


32 


4 


8 






+ 


N/A 




RC-L2(s)/3 


L2 


32 


4 


0.032 






+ 


N/A 




RC-J/6276tet-rif 


J 


8 


0.5 


8 




+ 




1 



Plasmid phenotype tests for presence and genotype of plasmid. 

Progeny strains used in subsequent recombinant crosses are shown in bold face. 
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^ ^-434of^ ^s)/70r^ 1 ^ ^/6276 te^ 1 ^ ^L2(s )/3^ 

T ▼ ▼ 



Figure 1 The genealogy of recombinant strains generated and explored in this study. The figure shows the parental strains used to 
generate recombinant strains. In each case, the parents in a cross are connected by a horizontal line, and the progeny are at the end of the 
arrow. In some cases, the progeny of one cross was used as a parent in a subsequent cross. Primary parental strain names includes drug 
resistance, and all recombinant strains (indicated by prefix RC- ) are both rifampicin and ofloxacin resistant. The colors used indicate the OmpA 
phenotype of each strain, as determined by fluorescence microscopy and genome sequence analysis. Strains containing the plasmid are shown 
in bold face and underlined. Crosses involving three parents are not shown because no triply drug resistant strains could be recovered. 



deletion of the C. trachomatis ORFs CT740-749, resulting 
in a progeny strain that contains only the C. suis homologs 
of CT740 through CT749. The results demonstrate that 
these C. suis sequences can complement any required 
function of the deleted C trachomatis genes for growth 
in vitro. 

Nucleotide sequence analysis of the recombinant 
genomes showed that some of these isolates lacked the 
chlamydial plasmid (Table 1, Figure 1). We originally hy- 
pothesized that loss of the plasmid was associated in some 
way with the recombination process. To explore this pos- 
sibility, PCR analyses were performed on all recom- 
binants, as well as the parents used in this study. Both the 
J/6276 rif and the F(s)/70 rif parents were negative for the 
plasmid, whereas the L2-434 ofl parent was plasmid- 
positive (Table 1, Figure 1). Because plasmid was absent in 
both the J/6276 rif and the F(s)/70 rif parents used in the 
crosses, plasmid loss in the resulting progeny was likely a 
function of stress associated with antibiotic-based selec- 



tion of strains prior to generating recombinants as opposed 
to a stress induced by the recombination process. 

The sequenced recombinant genomes allowed a com- 
parative survey of recombination events in progeny 
strains. The largest fragment that was laterally trans- 
ferred during recombination was 412,907 base pairs, 
found in RC-J(s)/122, while the smallest documented 
double crossover event was a 7 base pair fragment in the 
RC-L2(s)/3 strain. 

A total of 190 independent crossover regions were 
detected in the 12 recombinant strains. The distribution 
of these recombination sites was examined by mapping 
each crossover position from each of the 12 sequenced 
genomes to a single arbitrarily chosen F(s)/70 parental 
genome (Figure 5). There was generally a higher concen- 
tration of crossovers surrounding the rpoB locus (associ- 
ated with Rif resistance), and there were large regions of 
the chromosome that lacked evidence of recombination, 
such as the region surrounding CT001. 
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□ F(s)/70 



I I ribosomal operon □ Tet(C) island 



incA gyrA(of\) rpoB( rif) 



ompA 



RC-J/6276tet I 

rif R + tet R 

RC-F(s)/342 I 

rif R + tet R 

RC-L2(s)/46 

rif R + ofl R 

RC-F/69 

riP + ofl R 

RC-L2(s)/3 
rif R + ofl R 

RC-L2/55 

rif + ofl R 

RC-J/953 

rif + ofl R 

RC-J/943 

rif + ofl R 

RC-J/966 

rif + ofl R 

RC-L2/971 

rif R + ofl R 



J 82,162 



RC-J(s)/122 

rif R + ofl R + tet R 412,907 

Figure 3 Genome maps of recombinant strains, derived from complete nucleotide sequence analysis. The colors used in recombinant 
maps indicate the parental genotype, as is indicated at the top of the figure. The Tet(C) island is originally from C. suis R19. The approximate 
location of the genetic markers used in the construction of the recombinant genomes is shown above the RC-J/6276tet genome map. Below 
each strain name is the antibiotic resistance markers that the recombinant strain carries. The bracket and number below each genome map 
indicate the largest size of contiguous integrated DNA. The small brackets above each genome map indicate crossover regions that were 
confirmed by PCR amplification and Sanger sequencing. 
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■ L2-434 □F(s)70 ■ J/6276 □ Chlamydia suis 

□ tet(C) island □ ribosomal operon ■ crossover 

C. trachomatis 0. suis c. suis 

CT740-749 tet(C) island CT740-749 

RC-J/6276tet m^^M^^TH" I TT I 

RC-L2(s)/3 I ■ I I 

RC-J(s)/122 I A II I III 

Figure 4 Schematic diagram of the CT740 to CT749 regions in selected recombinant sequences. The colors used indicate the genotype of 
a given region. The ribosomal operons are shown in yellow, and crossover sites are shown in black. The deletion of the C. trachomatis homologous 
region of CT740 to CT749 in the RC-J(s)/1 22 sequence is indicated by the delta symbol. 



A recent report [24] indicated a strong preference for 
recombination at specific positions within trpB or gyrA 
in several recombinant progeny originally generated by 
Demars and Weinfurter [4]. We used two approaches to 
examine selected sets of candidate hotspots identified by 
these authors. First, we examined our original 12 recom- 
binant genomes for recombination events at common 
sites. While analysis of these fully sequenced recombin- 
ant strains identified four examples of recombination 
events that occurred within the same genetic region in 
independent progeny strains (Table 2, Figures 3 and 5), 
and none were found in more than 2 recombinant pro- 
geny. Second, we conducted PCR-based sequence analysis 
of a different set of completely independent recombinant 




Figure 5 The genomic location of crossover regions in each of 
the twelve sequenced recombinant progeny strains. The 

sequenced strain D/UW3Cx gene designations were used as the 
reference, with the location of gene CT001 indicated at the top 
of representative genome. The black tick marks indicates the 
location of a crossover region. The location of the plasticity zone 
is shown in gray. 

\ J 



crosses, using parental combinations (D/UW3Cx X LI/ 
440/LN; D/UW3Cx X L3/404/LN) that were nearly identi- 
cal to those analyzed by Srinivasan and colleagues [24]. In- 
dependence of these crosses was assured because each of 
the 14 examined progeny was the product of a fully inde- 
pendent cross of parental strains. In no examined case 
was there evidence for recombination at either of the loci 
identified by these authors, in any of the 14 progeny 
strains generated from these crosses (Table 3). 

Chi sites have previously been described as sites for 
homologous recombination in bacteria [25,26], and it is 
possible that a canonical chi site, or other sequence pat- 
tern, might be found at or near chlamydial recom- 
bination sites. The program MEME [27] was used to 
determine if any identified crossover sites were linked to 
a common sequence motif. These analyses support the 
hypothesis that recombination in vitro does not require 
specific target sequences and occurs at random sites 
across the genome. 

Genotypes associated with attachment efficiency 

Attachment efficiency in the presence or absence of 
centrifugation is a differentiating phenotype among C. 
trachomatis strains [22]. Strains of serovar L2 have a 
high rate of attachment in static culture, while the non- 
LGV serovars have a reduced ability to infect in the 
absence of centrifugation (Figure 6, [22]). We used a PCR- 
based analysis of attached EBs to examine the efficiency 
of attachment in our recombinant strains, relative to the 
parents of the crosses. Parental strains performed as pre- 
dicted in these assays, with our serovar L2 strain having 
little dependence on centrifugation for attachment, while 
centrifugation enhanced attachment by both the serovar F 
and Serovar J parental strains (Figure 6). However, the 
different recombinant progeny strains showed variability 
in attachment efficiency relative to ompA genotype, 
with individual progeny strains reflecting the attach- 
ment efficiency of either the Serovar L2 or serovar F/J 
parental strain. 
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Table 2 A comparison of shared crossover sites in different progeny strains 



Recombinant 


RC-L2(s)/3 


RC-F(s)/342 


Region of crossover 


CT778 {priA) 


CT778 (priA) 


Coordinates 


916870:917156 


954495 : 955597 


Comments 


F(s)70 - L2-434 hybrid CT778 


F(s)70 - J/6276 hybrid CT778 


Recombinant 


RC-L2(s)/3 


RC-J/966 


Region of crossover 


CT331 (dxs) and CT332 (pykF) 


CT332 {pykF) 


Coordinates 


377279 : 377995 


370626 : 37785 


Comments 


F(s)70 CT331, L2-434 CT332 


J/6276 - L2-434 hybrid CT332 


Recombinant 


RC-L2/971 


RC- J/966 


Region of crossover 


CT569 (gspG) and CT570 (gspF) 


CT569 (gspG) and CT570 (gspF) 


Coordinates 


634854:636140 


635246 : 636532 


Comments 


J/6276 CT569, L2-434 CT570 


J/6276 CT569, L2-434 CT570 


Recombinant 


RC-L2/971 


RC-L2/55 


Region of crossover 


CT585 {trpS) and CT586 {uvrB) 


CT586 (uvrB) 


Coordinates 


655362 : 656561 


656865 : 657292 


Comments 


L2-434 CT585, J/6276 CT586 


F(s)70 - L2-434 hybrid CT586 



A genome-wide association analysis was then used to 
determine if regions in the chlamydial genome could be 
associated with the observed attachment efficiency 
phenotype. Briefly, the sequenced recombinant genomes 
are aligned (12 recombinant strains and 3 parental 
strains), and every informative site (any position in the 
alignment where a different genotype is present) is ana- 
lyzed using the Fishers exact test to determine if that 
genotype is associated with observed phenotype. Five 



Table 3 Analysis of independent recombinant strains for 
recombination hot-spots 



Strain 


CT1 89 genotype 


CT315 genotype 


L3XDJ 


D 


L3 


L3xD_8 


D 


L3 


L3xD_9 


D 


L3 


L1xD_11 


D 


L1 


L1xD_12 


D 


L1 


L1xD_14 


D 


L1 


L1xD_15 


D 


L1 


L1xD_16 


D 


L1 


L1xD_17 


L1 


L1 


L1xD_18 


D 


L1 


L1xD_19 


D 


L1 


L1xD_20 


D 


L1 


L1xD_21 


L1 


L1 


L1xD_23 


D 


L1 



Individual recombinant progeny from independent crosses were subjected to 
PCR-based DNA sequencing and assessed for recombination at positions 
identified as hotspots by Srinivasan and colleagues [24]. For each sequenced 
product, the identified genotype at that region is indicated (D or L1/L3). There 
were no examples of recombination in any of these sequenced PCR products. 



genomic regions were identified that had the highest 
possible inverse Log j^-value based on sample size and each 
observation group size (Additional file 1: Figure SI). 
These include regions containing ORFs CT001 - CT089, 
CT139 - CT146, CT203 - CT215, CT740 - CT746, and 
CT783 - CT875. The predicted proteins encoded within 
these regions of the 3 parents and 12 recombinants were 
then compared using the MUSCLE sequence alignment 
software, and a total of 124 proteins had at least one non- 
synonymous amino acid change that was associated with 
the attachment phenotype (Additional file 2: Table SI). The 
chlamydial membrane proteins PmpE (14 amino acid 
changes), PmpF (110 AA changes), PmpG (28 AA changes), 
and PmpH (57 AA changes) were among the proteins with 
the highest number of non-synonymous amino acid 
changes. Other relevant genes that were associated with 
high attachment efficiency were ORFs CT089, and CT860 - 
862, ORFs encoding proteins involved in the Type III se- 
cretion process [28,29]. Differences in the sequences of 
proteins demonstrated by others to function in primary 
attachment (OmpA, [30], OmcB [31]) or proposed to be 
associated with very early events following contact 
(HSP70, [32]) were not associated with differential attach- 
ment efficiency, as measured by our assay (Figure 6). 

Variation in secondary inclusion formation between 
recombinant strains 

Formation of secondary inclusions in infected cells is an- 
other trait that varies among strains and serovars. For 
example, strains of serovars G and F commonly form 
secondary inclusions at a higher rate than strains of 
serovar J and L2 [23]. We explored the secondary inclu- 
sion phenotype of IncA-positive recombinant strains; 
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p < 0.005 



■ Parental strains 
I I Recombinant strains 



iin 



B 



w v) m at 

?T cm c5 SS 9> 9> 

J J J It - - 

6 6 6 6 

tc tc tc tc 



7 7 7 7 "? 
O O O O O 

ul on ol on cc u_ 



CO CO 



o 

Of 



4- 
o 
a: 



OmpA 
HSP70 
OmcB 
PmpA 
PmpB 
PmpC 
PmpD 
PmpE 
PmpF 
PmpG 
PmpH 
Pmpl 



■ ■ 



■ L2-434 ■ F(s)/70 ■ J/6276 

Figure 6 Attachment efficiency and subsequent genomic analysis of parental and progeny recombinant strains. Panel A: Measurement 
of the attachment efficiency for parental and recombinant strains. The specific strains analyzed are represented on the x-axis (center of figure), 
and the percent attachment efficiency is represented on the y-axis. Dark gray bars represent parental strains, and light gray bars represent 
recombinant strains. Panel B: The genotype of each strain for the 9 pmp genes and 3 other genes previously discussed as being associated with 
attachment are shown below each strain in graph. The colored boxes indicate the parental genotype of each gene, as indicated at the bottom of 
the figure. The pmp genes that are associated with attachment efficiency are indicated in bold. Boxes containing two colors indicate that a 
crossover event occurred within the gene in this strain. 



this analysis was not possible in strains that are IncA- 
negative, because our readout of secondary inclusions is 
dependent on antibodies to IncA. Of the eight IncA- 
positive recombinant strains tested, recombinants RC-J/ 
953 and RC-L2/971 showed extensive secondary inclu- 
sion production (Table 1, Figure 7). These results are sur- 
prising because both parental strains (J/6276 and L2-434) 
used to create RC- J/953 and RC-L2/971 are low secondary 
inclusion formers [23]. Recombinant progeny with high 
secondary inclusion phenotypes where both parents ex- 
hibit low secondary inclusion formation suggest a possible 
interaction between at least two chlamydial proteins, or at 
least two independent genetic markers, in the manifest- 
ation of the secondary inclusion phenotype. 

Quantitative analysis of possible loci associated with 
the secondary inclusion phenotype was inconclusive. 
This was a function of both the low number of 
recombinants available for analysis, and the fact the ap- 
parently multiple alleles are involved. 



Discussion 

Genetic manipulation of chlamydiae has been a historical 
barrier to research in this system, but some of these 
barriers have recently been removed. One such area of 
progress is the use and understanding of chlamydial 
recombination. There is considerable evidence for in vitro 
and in vivo recombination by chlamydiae, and the methods 
for generating chlamydial recombinants are becoming 
routine [4,5,24]. However, there remains a general lack 
of understanding regarding the cellular and molecular 
mechanisms associated with the process. The present study 
was initiated to address these challenges. We hypothesized 
that an investigation of both the process of genetic recom- 
bination in chlamydiae and the correlation of specific chla- 
mydial genotypes with phenotypes can be addressed using 
a combination of contemporary genome sequencing tech- 
nologies with our ability to create genetic recombinants 
among chlamydiae. This approach has also been used by 
Nguyen and colleagues [33] as part of a forward genetic 
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Figure 7 Fluorescent microscopic analysis of the secondary 
inclusion formation phenotype of recombinant strain RC-J/953. 

McCoy cells were infected at an MOI of -0.5, and images were 
taken 48 h post-infection. All cells were labeled with anti-lncA 
(green), and anti-OmpA (red), and DNA is labeled with DAPI (blue). A 
representative secondary inclusion is indicated by the white arrow in 
the bottom panel. The strain being analyzed is shown at the right of 
each image. Scale bar, 10 urn. 



strategy in these organisms, and the results of such experi- 
ments can be integrated with the recently developed chla- 
mydial transformation system [3] to develop and validate 
correlations between gene structure and protein function. 

Evidence for recombination in chlamydiae was first 
provided by nucleotide sequencing of genes or genomes 
taken from a variety of chlamydial strains. There are 
data in the literature suggesting that recombination 
hotspots might be present within or around ompA 
[7,11,12], and also at other locations in the genome [34]. 
Our genome sequencing has added some support for 
this premise, as the D(s)/2923 genome discussed by 
Jeffrey et al. [10] has a hybrid D/E OmpA sequence, and 
apparent recombination sites within this strain are at or 
very near sites seen in other, independently isolated, 
clinical strains [9,11]. Other investigators have proposed 
and debated the concept of chlamydial recombination 
hotspots using analysis of chlamydial genome sequences 
from laboratory-generated or clinical strains [8,24,35]. In 
the present study, we used two strategies to investigate 



the possible clustering of recombination events in vitro. 
First, we analyzed apparent crossover sites by genome 
sequencing of 12 recombinant genomes, which led to 
the identification of a total of 190 primary recombin- 
ation sites. The largest integrated fragment identified in 
these experiments was over 400,000 base pairs, which 
constitutes approximately 40% of the chlamydial gen- 
ome. The long recombined region observed in these 
progeny strains are consistent with the original obser- 
vations of Demars and Weinfurter [4], who discuss very 
large exchanges in their recombinants. Sequence data 
from clinical isolates do not provide evidence for such 
large exchanged fragments, but there is clear evidence 
of recombined regions of greater than 50,000 base 
pairs [6,10,35]. Chlamydial genomes are remarkably 
homogenous and syntenous, and we hypothesize that 
many recombination events that may have occurred, 
both in clinical strains and in laboratory-generated pro- 
geny, have been missed because of the low level of over- 
all genomic variability. 

Our data provided no evidence for increased frequency 
of particular recombination at specific sites surrounding 
markers used for selection (Figure 5). Certain areas of 
the genome were apparently devoid of recombination 
events, but these areas also were not physically linked to 
any of the selectable markers used for these studies. Our 
data provide no basis for these chromosomal sections 
being refractory to recombination. A total of four gen- 
omic locations were identified as possible recombination 
targets in more than one independent progeny clone. 
None of these four positions is identified as a recombi- 
nation hotspot in other studies [9]. No candidate hotspot 
regions within or immediately around ompA were identi- 
fied in any of our in vitro recombinants, and none of the 
positions are directly flanking the markers used for 
selection. 

A second approach to investigate chlamydial recom- 
bination hotspots was in response to work of Srinivasan 
et al. [24] who examined sequence data generated by 
Demars and Weinfurter [4], and identified candidate re- 
combination hotspots at several loci. We attempted to 
replicate these results by making completely indepen- 
dent recombinant clones using strains very similar to 
those used by these investigators, and examining pre- 
dicted loci for evidence of recombination. These clones 
were determined to be fully independent, because each 
was derived from a completely independent primary 
mixture of parental strains. We found no evidence of the 
use of recombination sites identified by Srinivasan and 
colleagues in any of the clones. Our inability to identify 
any hotspots surrounding previously identified recom- 
bination sites leads us to propose that most previously 
identified recombination hotspots were identified as 
such because: 1) there was significant in vivo selection 
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pressure for change at a locus (i.e. intra-OmpA or Pmp 
antigenic variation), or 2) the position being analyzed is 
identified because there simply was more sequence het- 
erogeneity in that region of the chromosome, or 3) the 
in vitro progeny identified as containing recombination 
hotspots were siblings in a single recombination event 
prior to being cloned out of a population. 

Each recombination event identified appeared to be a 
product of homologous recombination or gene conver- 
sion between highly related sequences. There was a sin- 
gle deletion event in one progeny strain, in which two 
virtually identical rRNA sequences were precisely de- 
leted to yield a single rRNA operon, with 17 kB of inter- 
vening sequences (10 genes, CT740 through CT749) 
removed in the process [RC-J(s)/122, Figure 4]. This was 
the only example of a deletion in any progeny strain, 
and there were no cases of a duplication event. These re- 
sults are consistent with the general sequence similarity 
and synteny found in the naturally mosaic C. trachomatis 
genomes, with occasional phenotype-associated deletions 
in specific regions of the chromosome [36,37]. 

The three-way crosses were designed to examine the 
possibility that multiple parents could be involved in 
generation of a single recombinant progeny. We saw no 
evidence of a three-way cross in any of our selection ex- 
periments or in any genome sequence analysis, even 
though multiple independent two-way crosses were re- 
covered from those experiments. If the probability of a 
three-way event is a function of the probability of two 
independent recombination events, it is likely that not 
enough individual recombinants were screened to iden- 
tify an extremely rare progeny clone. There is, however, 
one issue that is addressed by the absence of any evi- 
dence for contribution of three parents in a cross. In 
many of the recombinants identified by our group and 
in studies by Demars and colleagues [4,38], multiple 
fragments from each parental genome are found in a re- 
combinant progeny, often in regions of the chromosome 
that were not selected for with the tested antibiotics 
(Figures 3 and 5). It is possible that these differently 
recombined fragments involve sequential and independ- 
ent recombination events occurring during the mixed 
infections used in this procedure. If involvement of mul- 
tiple chlamydiae was a common occurrence in the gen- 
eration of a cross, we hypothesized that some progeny 
from the three-way crosses should carry fragments of 
each parent. As no single progeny strain was identified 
with fragments of each of the parents in the three-way 
cross, our results do not support this hypothesis. There- 
fore our current model is that the generation of recom- 
binant progeny is the result of a single exchange event 
between two parents, and that these exchanges can in- 
volve very large fragments of the chromosomal DNA. 
This latter result is consistent with analyses by other 



laboratories [4,9,33,35,38]. Subsequent recombination 
events will then lead to differential integration of frag- 
ments of the exchanged DNA, leading to the mosaicism 
seen in many of the recombinants. 

The attachment efficiency in the absence of centrifuga- 
tion measured for the different recombinants revealed 
groups having either a high attachment efficiency, as 
exhibited by LGV strains, or a low attachment efficiency, 
as exhibited by non-LGV urogenital strains (Figure 6). 
Genome wide association analysis of this phenotype re- 
vealed a number of loci that were quantitatively associ- 
ated with the attachment efficiency phenotype seen in 
cell culture. While the list of candidate alleles that might 
be associated with this phenotype includes a wide varie- 
ty of genes (i.e. type III secretion -associated ORFs 
[28,29]), we focus this discussion on proteins known or 
hypothesized to be on the surface of the chlamydial 
elementary body. One collection of proteins that is 
highly associated with attachment efficiency were six of 
the Pmp proteins (PmpDEFGHI), a group of chlamydia- 
specific auto transporters, some of which are known to 
be surface exposed and are logical mediators of attach- 
ment to cells [39]. A bioinformatic analysis of Pmp se- 
quence and structure demonstrates that four of these 
encoded Pmps (PmpEFGH) vary consistently in relation- 
ship to the described phenotype, and these changes include 
alterations of the net negative charge of the Pmp protein 
(Additional file 2: Tables S1-S2 and Additional file 3: 
Table S2). It is, however, preliminary to assess any pro- 
perty of a single protein or small set of proteins with the 
attachment efficiency distinction among strains. The re- 
cently developed genetic transformation system will be 
a critical technology in directly assessing such relation- 
ships in this species [3]. 

The second phenotype investigated in our study was 
the formation of secondary inclusions within infected 
cells. This property of C. trachomatis strains varies not 
only between C. trachomatis serovars, but also between 
strains within serovars [23]. An intriguing result was the 
identification of high secondary inclusion formers in 
crosses between parents that exhibited very low second- 
ary inclusion formation phenotypes (Table 1, Figure 7). 
While interpretations of this result are preliminary, it 
appears that the phenotype is associated with two or 
more regions of the genome, and that a specific combi- 
nation of genotypes at these positions is required for 
the high secondary inclusion formation phenotype to 
be manifested. 

Continued examination of novel recombinants, includ- 
ing backcrosses to integrate more parental genome into 
recombinant strains will add clarity to the phenotypes 
we have discussed. We also continue to use the re- 
combinants as tools to understand the basic processes 
associated with genetic exchange in the chlamydiae. 
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Conclusion 

The described experiments characterize in detail the 
products of genetic exchange by C. trachomatis in vitro. 
Sequences representing over 1/3 of the chlamydial 
chromosome can be incorporated during these crosses. 
Selected phenotypes can be segregated in these crosses. 
This approach can be combined with the novel DNA 
transformation technologies being developed in these 
bacteria, leading to novel approaches for determining 
the relationship between genetic makeup and chlamydial 
phenotype, both in vitro and in vivo. 

Methods 

Chlamydial strains and selection for resistance 

Antibiotic resistant C. trachomatis strains J/6276 nf , RC-J 
/6276 tet ' rif , F(s)/70 rif , F(s)/70 tet ' rif L2-434 ofl ,DUW/3Cx ofl , 
Ll/440/LN rif or L3/404/LN rif were generated as previously 
described [5]. Briefly, strains were grown in McCoy cells 
at a multiplicity of infection (MOI) of 1 in media 
containing sub-inhibitory concentrations, equivalent to 
half the minimum inhibitory concentration (MIC) of the 
appropriate drug. Serial passages of these strains were cul- 
tured in the media containing desired antibiotics until re- 
sistant mutants emerged or until passage was completely 
negative. Some strains required several attempts until 
resistant mutants were isolated. Isolates were then cloned 
by a twofold limiting dilution method. The resulting 
cloned elementary bodies (EBs) were grown to high titers 
and were partially purified by centrifugation of lysates of 
infected cells through a 30% MD-Gastroview® pad 
(Mallinckrodt Inc. St Louis). 

Generation of recombinant clones for complete genome 
sequence analysis 

Recombinants isolated for genome analysis were gener- 
ated from two sets of crosses (Table 1). The first of these 
involved two parental strains; L2-434 ofl and F(s)/70 rif 
and the second was a three-way cross with the parental 
strains F(s)/70 tet rif , J/6276 rif and L2-434 ofl . Recombination 
experiments were conducted as previously described [5]. 
Briefly, crosses were performed in McCoy cells seeded in 
sets of individual shell vials. The monolayers were then 
infected with different combinations of drug-resistant 
strains each at an MOI = 2, ensuring infections of cells 
with both strains. Cultures were incubated for 48 h post- 
infection in the absence of antibiotics and were then de- 
tached and lysed using a -80C/37C freeze-thaw cycle [5]. 
Potential recombinants were selected by inoculating 50 \A 
of the freeze-thaw lysates from each shell vial onto a new 
shell vial monolayer and overlaying with a medium 
containing antibiotics at 1/4 the MIC for each resistant 
parental strain. In the case of the three-way cross [F(s), J, 
L2], three different combinations of drug were applied to 
the infected monolayers (MOI = 2). These combinations 



included ofloxacin/rifampicin, ofloxacin/tetracycline, and 
ofloxacin/rifampicin/tetracycline. 

Generation of recombinant chlamydial strains for analysis 
of recombination hot spots 

Multiple independent shell vials containing confluent 
McCoy cells were inoculated sequentially with ofloxacin- 
resistant D/UW3Cx and rifampin-resistant L1/440/LN 
or L3/404/LN strains, and incubated 48 h in medium 
lacking antibiotics. Monolayers were lysed and used as 
inocula onto fresh McCoy cells at MOI = 1, and incu- 
bated in the presence of 4X the MIC of the drugs used 
for selection, rifampin and ofloxacin. These concentra- 
tions were previously determined to be sufficient to se- 
lect for individual recombinant strains resistant to both 
drugs. Incubation of either parent in this combination and 
concentration of antibiotics at MOI = 1 never yielded a 
doubly resistant mutant parent. Chlamydial recombinants 
growing in this mixture of antibiotics were propagated 
and cloned by limiting dilution. Only a single recombinant 
progeny was collected from each lineage from a single 
original inoculated shell vial. DNA was harvested from 
these clones, and PCR primers were created that flanked 
regions of suspected recombination hotspots identified by 
Srinivasan and colleagues [24]. The Phusion high fidelity 
DNA polymerase (New England Biolabs, Ipswich, MA) 
was used to generate PCR products from these regions, 
and these were sequenced at the Oregon State University 
Center for Genomics Research and Biocomputing. Se- 
quences were then examined for possible recombination 
by comparing informative sequence polymorphisms rela- 
tive to each parent in the cross. 

Chlamydia recombinant strain genomic DNA preparation 

Recombinants were clonally isolated using limiting dilu- 
tion and EB purification was conducted as previously de- 
scribed [23,40]. Purified EBs were incubated for 60 min 
with 4 units/mL RQ1 DNase (Promega) followed by treat- 
ment with 2 mM EGTA (RQ1 Stop solution, Promega) 
to inactivate the DNase. Elementary bodies were then 
suspended in Qiagen Genomic buffer Bl supplemented 
with dithiothreitol (5 mM) and DNA was then extracted 
using the Qiagen Genomic Tip kit, (Qiagen, Valencia, CA) 
following the manufacturer s instructions. 

Genome sequencing and sequence analysis 

Genomic DNA from recombinant strains was processed 
for Illumina-based paired-end sequencing using com- 
mercial DNA preparation kits (Alumina Inc., San Diego, 
CA) following the manufacturers instructions. Each 
recombinant genome was first assembled using the 
reference-guided assembly program Maq [41]. Appropri- 
ate parental genomes were used as references in the ana- 
lyses. Regions in reference-guided assembled genomes 
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where Maq could not resolve sequence were then com- 
pared to contiguous sequences assembled using de-novo 
assembly software Velvet [42] and a single contiguous 
draft sequence was produced. 

To confirm the clonality of the recombinant genomes, 
and to quality control our assembly process, two to four 
apparent crossover regions in each recombinant progeny 
were amplified by PCR and sequenced using classical 
Sanger sequencing. In all cases the sequenced amplicon 
contained the appropriate informative sites from each 
parent involved in the cross (not shown). 

Recombinant maps of each genome were produced by 
computationally parsing a draft genome against the two 
parents used to generate the recombinant, using the 
alignment program MAFFT with the default settings 
[43,44]. Any detected crossover regions were manually 
analyzed using MacVector sequence analysis software 
(Cary, NC). Crossover regions were defined as the inter- 
vening homologous sequence between two informative 
sites (defined as a nucleotide position that varied in se- 
quence between the two parent genomes), where the in- 
formative site was the same as one parent at one position 
and the same as the second parent at an immediately adja- 
cent informative site. 

Whole genome alignments including all recombinant 
strains and the 3 parental strains were constructed using 
MAFFT with default settings. Any position in this align- 
ment where at least one genome had a variable base was 
further analyzed using the Fisher exact test as a metric 
to determine if the variable genotype could be associated 
with a given phenotype. In these analyses, a low p- value 
indicated an association between the base sequence and 
a specific parental phenotype or genotype. A variable 
genotype was considered to be associated with a given 
phenotype if the calculated ^-value was the lowest pos- 
sible based on the sample size. The ^-values calculated 
by the Fisher s exact test were inverse Log-transformed 
and plotted using the statistical analysis program R 
(http://www.r-project.org/). 

Whole genome alignments of all recombinants against 
both parents were used to determine if any random mu- 
tations had occurred during culture and the generation 
of recombinants. A random mutation was defined as a 
nucleotide in the recombinant sequence that was diffe- 
rent than the nucleotide of either parent at the same nu- 
cleotide position. All ORF designations are based on 
numbering system used for the C. trachomatis D/UW3 
genome sequence [31]. 

Measurement of attachment efficiency 

McCoy cell monolayers were seeded in duplicate 24 well 
plates at 90% confluency, and triplicate wells of each 
plate were infected using a target MOI = of 1. Plates 
were then either centrifuged at 640 x g (2000 RPM on 



Beckman Coulter, Allegra X-15R centrifuge) for 1 h or 
placed on a rocker platform for 1 h, with both treat- 
ments being at room temperature. Wells were then 
washed 3 times with Hanks balanced salt solution and 
DNA was extracted directly from each well using the 
Qiagen DNeasy Blood and Tissue kit, with the lysis buf- 
fer supplemented with 5 mM dithiothreitol. Each sample 
was pipetted up and down 10 times to disrupt both host 
cells and chlamydiae. Genome copy number was deter- 
mined for each treatment by qPCR, using a probe for 
hsp60 (groEL_3, CT755). A cloned and quantified ver- 
sion of CT755 was used as the standard curve on all 
qPCR plates, ensuring that each sample being analyzed 
was properly quantified. The target sequence for this 
assay is conserved among C. trachomatis, but was 
unique to this hsp60 allele, as demonstrated by PCR 
analysis of alternate hsp60 open reading frames (CT110 
and CT604; not shown). Attachment efficiency was then 
calculated by dividing the genome copy number of the 
rocked samples by the genome copy number of the 
centrifuged samples. 

Quantification of secondary inclusion formation 

The frequency of secondary inclusion formation in par- 
ental and progeny strains was determined using previ- 
ously described methods [23]. Briefly, McCoy cells were 
infected with the strain of interest at an MOI = of 0.3. 
These cells were incubated for approximately 24 hpi 
prior to fixation with methanol. C. trachomatis IncA was 
labeled with mouse monoclonal anti-IncA, and chlamydial 
developmental forms were labeled with mouse anti- 
lipopolysaccharide [23]. Cells were then labeled with ap- 
propriate secondary antibodies (Southern Biotechnology 
Associates, Birmingham, AL) and observed using 400 x or 
lOOOx magnification. A semi-quantitative measure of 
secondary inclusion formation was conducted by deter- 
mining the fraction of infected cells having secondary 
inclusions versus the total number of infected cells. A 
1+ to 4+ scoring system was used to quantify secondary 
inclusion formation and each score was determined on 
three independent sets of coverslips. A 1+ value is 
equivalent to less than 10% of infected cells having sec- 
ondary inclusions, while a 4+ value is equivalent to 40% 
or more of the cells having secondary inclusions. 

Genome sequence accession numbers 

The genome sequences of the parental strains used to 
generate recombinant sequences and the previously se- 
quenced C. trachomatis strains used in the whole genome 
alignment studies are in the DDBJ/EMBL/GenBank data- 
base under the following accession numbers: D/UW3Cx, 
AE001273; L2-434Bu, AM884176; L2/UCH1, AM884177; 
L1/440/LN, HE601950; L3/404/LN, HE601955; D(s)/2923, 
ACFJ01000001; E/11023, CP001890; E/150, CP001886; Gl 
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9768, CP001887; G/11074, CP001889; G/11222, CP0018 
88; F/70, ABYF01000001; F(s)/70, ABYG01000001; J/6276, 
ABYD01000001; J(s)/6276, ABYE01000001. 

The C. trachomatis genome accession numbers of the 
recombinants used in this study have been deposited in 
the DDBJ/EMBL/GenBank database under the following 
accession numbers: RC-F/69, CP002671; RC-L2(s)/46, 
CP002672; RC-F(s)/852, CP002673; RC-J/943, CP002674; 
RC-J/953, CP002675; RC-L2(s)/3, CP002676; RC-F(s)/342, 
CP002677; RC-J(s)/122, CP002678; RC-J/966, CP002679; 
J/6276tetl, CP002680; RC-L2/971, CP002681; RC-L2/55, 
CP002682. 
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